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Abstract. A single physical interaction might not be universal for quantum 
computation in general. It has been shown, however, that in some cases it 
can achieve universal quantum computation over a subspace. For example, 
by encoding logical qubits into arrays of multiple physical qubits, a single 
isotropic or anisotropic exchange interaction can generate a universal logi- 
cal gate-set. Recently, encoded universality for the exchange interaction was 
explicitly demonstrated on three-qubit arrays, the smallest nontrivial encod- 
ing. We now present the exact specification of a discrete universal logical 
gate-set on four-qubit arrays. We show how to implement the single qubit 
operations exactly with at most 3 nearest neighbor exchange operations and 
how to generate the encoded controlled-NOT with 27 parallel nearest neighbor 
exchange interactions or 50 serial gates, obtained from extensive numerical op- 
timization using genetic algorithms and Nelder-Mead searches. We also give 
gate-switching times for the three-qubit encoding to much higher accuracy 
than previously and provide the full specification for exact CNOT for this 
encoding. Our gate-sequences are immediately applicable to implementations 
of quantum circuits with the exchange interaction. 



1. Introduction 

To implement universal computation in the quantum regime, one must be able 
to generate any unitary transformation on the logical qubit states. By now it has 
become part of the quantum computation folklore that the group SU (2) of single- 
qubit operations and an entangling two-qubit operation such as the controlled-NOT 
{CNOT) can generate any unitary transformation exactly ^ EJ- Furthermore 
it has been shown that there are discrete universal elementary gate-sets which 
approximate any unitary transformation with arbitrary precision efficiently 1 (see 
00] for details). One such set is comprised of {H, ^, CNOT} [5], where H is 
the Hadamard transform and ^ is a phase gate, both acting on a single qubit. In 
this sense, H, ~ and CNOT comprise a quantum analogue to a classical universal 
logical gate-set. 



Date: February 1, 2008. 

^We use efficient in the computational sense, meaning that we can implement the transforma- 
tion with a number of elementary gates polynomial in the number of qubits. Note that not all 
general unitary transformations can be implemented efficiently; in fact the generic unitary trans- 
formation on n qubits requires an exponential amount of elementary gates. Our usage of efficient 
here means that given there is a sequence of one- and two-qubit gates that generates U then we 
can approximate this U to arbitrary accuracy with a sequence of gates drawn from our elementary 
set and such that we only have polynomial overhead in the number of gates used. Further, to 
double the precision we only need a constant amount of additional gates. This is the notion we 
need to define efficient computation. 
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The traditional paradigm of quantum computation of "one physical qubit = one 
logical qubit" is often hard to implement because in the presently known menu of 
physical implementation schemes, it is usually difficult to control at least one of 
either the single-body or the two-body operations jo]. 

A prime example is the Heisenberg interaction (with Hamiltonian H 1 ^ = Jij Si ® 
Sj between spin particles i and j, where Si = \<j % and cr l x yz are the usual Pauli 
matrices acting on qubit i). It has many attractive features [3 IE] that have led 
to its being chosen as the fundamental two-qubit interaction in a large number 
of recent proposals: Its functional form is very accurate — deviations from the 
isotropic form of the interaction, arising only from relativistic corrections, can be 
very small in suitably chosen systems. It is a strong interaction, so that it should 
permit very fast gate operation, well into the GHz range for several of the propos- 
als. At the same time, it is very short ranged, arising from the spatial overlap of 
electronic wavefunctions, so that it should be possible to have an on-off ratio of 
many orders of magnitude. We will assume that the interaction can be switched 
on and off between coupled qubits [Hj- Unfortunately, the Heisenberg interaction 
by itself is not a universal gate, in the sense that it cannot generate any arbitrary 
unitary transformation on a collection of spin-1/2 qubits. So, every proposal has 
supplemented the Heisenberg interaction with some other means of applying inde- 
pendent one-qubit gates (which can be thought of as time-dependent local magnetic 
fields). But the need to add this capability to the device adds considerably to the 
complexity of the structures, by putting unprecedented demands on "g-factor" en- 
gineering of heterostructure materials [§l ll()| . requiring that strong, inhomogeneous 
magnetic fields be applied, or involving microwave manipulations of the spins that 
may be slow and may cause heating of the device. These added complexities may 
well exact a high cost, perhaps degrading the quantum coherence and clock rate of 
these devices by several orders of magnitude. 

Encoded universality provides a way around this problem in some crucial 
cases, for example when the "easy" interaction is the exchange interaction, by 
entirely eliminating the need for single-body physical operations. By encoding 
each logical qubit in an array of multiple physical qubits, sequences of two-body 
nearest-neighbor exchange interactions are sufficient to generate the logical SU (2) 
and CNOT operations 2 on the encoded qubits |14llllllTK] and single-spin operations 
and all their attendant difficulties can be avoided. 

One drawback of the theory of encoded universality ^1] is that it establishes the 
sufficiency of certain two-body interactions for universality in a non-constructive 
way, not offering explicit methods with which to specify the sequences of physical 
implcmcntable Hamiltonians corresponding to the encoded logical gates. In par- 
ticular it is not clear at the outset how many physical interactions are required to 
implement each of the logical gates in some layout of the qubits. Encoded computa- 
tion schemes are only viable if the number of physical interactions to be applied to 



Note that it has been shown that a generic two-qubit interaction alone generates universal 
computation 1 1 21 1 1. 'il . However, by an irony of nature most implementable interactions in current 
quantum computation schemes happen to fall in the set of exceptions to this. These exceptions 
include the ubiquitous exchange interaction (both isotropic and anisotropic) and several other 
interactions that exhibit a certain amount of symmetry, which makes them non-generic in the 
above sense. Even for interactions that fall into the category of being universal by themselves, 
explicit gate-constructions have to be found in a case by case basis. 
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the qubits is not too large, where the threshold is determined by currently achiev- 
able decoherence and switching times. In most cases, numerical methods are the 
only way to find explicit sequences of Hamiltonians for a set of universal gates for 
some realistic arrangement of the physical qubits. Recently, more or less explicit 
universal logical gate-sets have been given for a three-qubit encoding using only 
the exchange interaction |16| . for the XY- interaction |Hj and for the generalized 
anisotropic exchange Hamiltonian |17| . In |lf>| an initial encoding of three physical 
qubits per logical qubit is used and a sequence of 19 Hamiltonians is presented 
that implements the encoded CNOT. However this CNOT is given up to local 
unitary operations only, and the encoded single-qubit operations are given in terms 
of Euler-angle rotations for the group SU(2). Some further processing is needed 
to obtain a universal discrete gate-set, needed to implement quantum circuits in 
terms of the computational basis |18j . 

We present here a complete scheme for universal quantum computation on four- 
qubit encodings in a one- (or two-)dimensional layout with nearest neighbor inter- 
actions only. We specify the encoding and layout and give all the gate switching 
times to obtain the encoded H, ^ and CNOT in the computational basis without 
further post-processing. This scheme provides an immediately applicable building 
block for exchange-only quantum circuits. We also provide new gate sequences for 
the CNOT in the three-qubit encoding to higher precision and with different sym- 
metries than in and provide the complete set of gates for the exact encoded 
CNOT in this smaller encoding. Although the four-qubit encoding has a slightly 
larger overhead in spatial resources than the three-qubit encoding it offers several 
advantages. A quantum computation begins by setting all encoded qubits to the 
(logical) zero state. In our scheme this state is a tensor product of singlet states. 
This state is easily obtained using the exchange interaction: if a strong H\i is 
turned on in each coded block and the temperature made lower than the strength 
J of the interaction, these two spins will equilibrate to their ground state, which is 
the singlet state. Unlike the smaller three-qubit encoding we do not require here 
any additional weak magnetic fields for initialization. This aspect renders the four- 
qubit encoding particularly attractive. Another advantage is that the four-qubit 
scheme is conceptually simpler for use in quantum logic when the properties of 
robustness to noise are also taken into account. Whereas the four-qubit logical 
states constitute a decoherence free subspace (DFS) under collective decoherence 
[Tl| , the three-qubit logical states constitute a decoherence free subsystem in which 
the logical state evolution is defined by only one component of the tensor product 
space. A third advantage of the four-qubit encoding is that additional protection 
against single qubit errors can be achieved in this case by control of extra exchange 
interactions to form a supercoherent qubit |19|. 

The structure of this paper is as follows. First, we describe the four-qubit en- 
codings which define our logical space and give the physical layout of the qubits. 
We then specify the Hamiltonians required to generate the single-qubit operations 
and the CNOT gate on the encoded qubits. The numerical methods used to ob- 
tain the exchange gate sequence for the encoded CNOT and further details are 
described in Appendices lAl and IbI The same numerical methods are used to obtain 
new high accuracy gate sequences for the exact CNOT in the three-qubit encod- 
ing. We conclude with a brief discussion of a solid-state implementation scheme 
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in which these results can be readily applied, and some open problems meriting 
further consideration. 

2. FOUR-QUBIT ENCODING 

We define the logical zero-state and one-state on one encoded qubit as 

(1) |0 i )=i(|01)-|10))<8(|01)-|10)) 

(2) = (\t+) ® \t-) - |*o) ® |*o) + |*-) <8 \t+)) , 

where |i ) = ^ (|01) + |10», = |00>, and = |11). For a more detailed 
discussion on how to obtain these encodings, refer to [111 I15| . To initialize a com- 
putation all logical qubits have to prepared in the \0l) state. Note that here the 
|0l) state is a tensor product of singlets 4=(|01) — |1Q)). As detailed in the previous 
section this will be advantageous in many experimental settings since it will permit 
easy initializion of the logical qubits at the beginning of a computation. 

The arrays are spatially configured to permit serial nearest-neighbor exchanges 
between the physical qubits in a one-dimensional layout: 

1 2 3 4 5 6 7 8 



encoded qubit 1 encoded qubit 2 

Figure 1 . Serial configuration of eight physical qubits comprising 
a system of two encoded qubits. 

We could also imagine these qubit arrays in a two-dimensional layout, where 
several of the one-dimensional layers are stacked on top of each other. Our con- 
struction of gate sequences will also hold for the two-dimensional setting. It suffices 
to note that the only difference is that along the layers of arrays the fourth qubit of 
each array is coupled to the first qubit of the next, whereas between layers the first 
qubit of one array in one layer is interacting with the first qubit of an array in the 
other. But note that both encoded states \0l), are symmetric with respect to 
swapping qubit 1 with qubit 4 and qubit 2 with qubit 3, so we obtain exactly the 
same gate-sequence for CNOT for a coupling of the two first qubits of an array - 
all we need to do is to relabel the qubits on the bottom array as 4, 3, 2, 1. 

The basis states of the logical space defined by two encoded qubits are \0l0l), 
|0lU>, |1l0l> and \l L l L ). 

3. Single- Qubit Operations 
Our goal is to construct the single-qubit Hadamard H and ? gates, defined as 

(X\ ^ - P «*/8 ( e_W8 ^ H - 1 f 1 M 

(3) 8 E { eW8 )> H -rf\l 1 J' 
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(4) E^ +1 = \(al®al +1 +al®al +1 +al®al +1 +P®P +1 ) = 



on the encoded qubits, using a sequence of exchange interactions H l £ +1 = (er^cr^" 1 " 
a y a y +1 + a l z a l z +1 ) on adjacent pairs of physical qubits i and i + 1. The matrices 
<7* y z are the usual Pauli matrices acting on the ith qubit and Ji,i+i is the cou- 
pling constant. When we write H^ +1 we assume that the Hamiltonian acts on the 
ith and i + 1st qubit as specified and as the identity on all the other qubits. For 
convenience we are going to add a multiple of the identity to H E (which just gives 
an unobservable global phase) , and work with the rescaled interaction 

/ 1 \ 
10 
10 
\ 1 j 

so that exp (—{Jij /h)tSi ■ Sj) = exp (—i{Jij /2h)tE 1 '^ (up to a global phase). We 
will give all exchange times in units of 2fi/ Jy |7j. 

Consider the effect of two particular exchanges on the logical states of a single 
encoded qubit. First, we note that in the code-subspace the action of the exchange 
E 1 ' 2 is equal to that of E 3 ' 4 , with both of these generating the transformation 
|0l) — > — |0t) and — ► |1l)- So -E 1 ' 2 is equivalent to a a z operation on a single 
logical qubit. Therefore the Hamiltonian for the encoded ? operation, up to an 
unobservable global phase, is exactly e't^ 1,2 . 

Next, consider the exchange E 23 . The action of this in the code-space is equiv- 
alent to E 1 ' 4 . The effect of the E23 operation on a single logical qubit is: 

(5) E 2 ' 3 = 

By examining the effect of the E ' and E 2,3 operations on the code-space, we 
can obtain an exact encoded Hadamard operation as H = e ltlE ' e it2E ' e ltlE ' , 

where t\ = \ arcsin = 0.4777 and t% = arccos = 0.9553. 

The exact specifications of exchange operations for these single qubit gates are 
depicted in Fig. 2. 

4. Encoded CNOT operation 

To obtain the encoded CNOT we used numerical methods and proceeded in two 
stages. In the first step we attempted to obtain a gate U^^ an9e that is equivalent 
to the encoded CNOT up to local unitary transformations on the encoded qubits. 
Ucnot an9e 1S locally equivalent to CNOT if there are single-qubit unitary operations 
Ui, U2, Vi, Vi (acting on the first and second encoded qubit, respectively) such that 




(6) CNOT = (Ui (8 U 2 )U e c Zt 9e {Vi <8> V 2 ) 

In the second stage we numerically obtained the local unitary operations U\, U2, Vi, V2 
to get from U^^ an9e to the real CNOT in the computational basis and to obtain 
the gate-sequences of exchange interactions corresponding to each of these local 
operations Ui, Vi, i = 1,2. The reason for splitting the task into these two stages is 
the following. A result by Makhlin |20I shows that all locally equivalent gates are 
characterized by only three real parameters, Mi (a complex number) and M 2 (real) , 
which we will refer to as the Makhlin-invariants in what follows. Appendix lAl gives 
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Figure 2. a) One exchange interaction is sufficient to generate 
the encoded ^ gate, b) three nearest neighbor exchanges allow 
to generate the encoded Hadamard gate. The t values are the 
time parameters corresponding to the individual exchanges with 
t = tt/8 = 0.3927, ti = 0.4777 and t 2 = 0.9553. All times are given 
in units of 2h/Jij. 



a brief summary of how Mi and M 2 are calculated. For the CNOT, Mi = and 
Mi = 1. The reduction to three parameters greatly simplifies the numerical search 
and allowed us to obtain the gate sequences by a combination of genetic algorithms 
[2"T] and Nelder-Mead simplex searches [221 I2H] ■ The Makhlin invariants give rise 
to a simple fitness function / = \\M x {CNOT) - M X (C andidate)\\ + \\M 2 {CNOT) - 
M2(Candidate)\\, where ||.|| is the complex norm. A gate sequence that generates 
a value / = is therefore equivalent to a CNOT. 

It is imperative for succesful quantum computation over a subspace that any 
permissible operation over the encoded qubits must act unitarily on linear com- 
binations of these basis states and not "leak" any amplitude out of the subspace 
into its complement and vice versa. We will capture this requirement by defining a 
leakage parameter A. Then any permissible two-qubit physical operation W must 
keep the code-subspace invariant, i.e. obey the following equation 3 : 

1111 

(7) A = 4 - E E E K s Lt L \W\u L v L )\ 2 = 0. 

s=0 t=0 u=0 v=0 

If leakage occurs A > 0. We note that any W that is locally equivalent to CNOT 
or any other unitary logical operation over two encoded qubits must by definition 
generate a leakage parameter of A = 0. However, we have found through numerical 
inspection that the search space in this problem is heavily pocked with local minima 
in which the Makhlin invariants are close to the desired values for CNOT, but 
which "leak" out of the logical space by generating a A value larger than 0. In our 
numerical searches, we overcame this problem by defining our fitness function as T 
= f + A, optimizing explicitly for not only a Makhlin invariant match, but also for 
non-leakage. 



J Note that exchange operations within an array of 4 qubits keep the one-qubit code space 
invariant and do not leak. Leakage can only occur when we couple two arrays (see 1141 for 
details). 
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A detailed description of our search algorithms and of the accuracy of our gate 
sequence can be found in Appendix FBI 

For a gate U^^ an9e , locally equivalent to the CNOT, we found a gate sequence 
of 34 nearest neighbor exchange interactions. Figure 3 and Tabled show the layout 
and time parameters. 
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Figure 3. Gate sequence of 34 exchange interactions for the en- 
coded CNOT, with the corresponding time parameters, given in 
Table [B 



qubit 2 



qubit 1 



Exchange 


Qubit 


Qubit 




Exchange 


Qubit 


Qubit 




Time 


1 


2 


Times 


Time 


1 


2 


Times 


h 


4 


5 


1.90680 


tis 


7 


8 


0.95629 


<2 


3 


4 


1.59536 


£l9 


2 


3 


1.06260 


*3 


5 


6 


1.26290 


^20 


6 


7 


0.68131 


i 4 


2 


3 


1.59745 


«2l 


3 


4 


0.59800 


is 


6 


7 


2.06920 


£22 


5 


6 


1.19942 


U 


1 


2 


0.05331 


£23 


4 


5 


1.04719 


tr 


7 


8 


0.76951 


£24 


3 


4 


3.14138 


ts 


2 


3 


1.59747 


£25 


5 


6 


0.95529 


tg 


6 


7 


0.71337 


£26 


2 


3 


1.63957 


tio 


3 


4 


1.59958 


£27 


6 


7 


1.91303 


hi 


5 


6 


1.26287 


£28 


1 


2 


2.47920 


tu 


4 


5 


1.90667 


£29 


7 


8 


2.18627 


tia 


3 


4 


0.59810 


£30 


2 


3 


1.05736 




5 


6 


1.71467 


*31 


6 


7 


0.94814 


£15 


2 


3 


1.06264 


£32 


3 


4 


3.14170 


£l6 


6 


7 


0.91559 


£33 


5 


6 


4.09690 


tl7 


1 


2 


2.30240 


£34 


4 


5 


2.09434 



Table 1. Gate switching times for the sequence of 34 exchange 
interactions of Figure 3, given in units of 2H/ J. 
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Note that those exchanges that are on a vertical line in Figure 3 involve disjoint 
sets of qubits and can be applied in parallel, where each cycle of gates lasts as long 
as the longest switching time in the set of parallel gates. If we count the number 
of parallel operations, we obtain 19 gate cycles. 

In the second stage we searched for the encoded local unitary gates XJ\ , U2 , V\ , V2 
(see Eq. (jHJl) that transform the 34-gate sequence into an exact CNOT on the 
computational basis states, Eqs. Q and HI It has been shown previously HH] 
that each encoded local unitary can be obtained by a sequence of four exchange 
gates. We employ here the nearest neighbor layout as shown in Figure The 
constructions in |15U16| involve non-nearest neighbor interactions, with E13 instead 
of E23. It is easy to see, however, that replacing E13 with E23 in the arguments of 
|15l I16| also leads to a sequence of four exchanges. 
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Figure 4. Local unitaries can be generated using a sequence of 4 
exchange gates as shown. 

These 16 remaining gates (4 for each local unitary) can be obtained either as 
the result of numerical optimization of a suitable cost function, or from solving the 
system of non-linear equations derived from the clement-wise equivalency condition 
between the objective matrix and the product of the four exchange matrices. We 
employed the optimization approach using a Nelder-Mead simplex search because 
of its high efficiency and generality. A similar approach has been used in |18) . where 
the cost function (a combination of a matrix distance between the actual gate and 
the desired gate and a non-leakage requirement) and the details of our numerical 
search calculations can be found. To reduce the probability of sampling only local 
minima, we sampled a large number (5 million) of randomly selected initial points 
in parameter space. We were able to determine the four local unitaries and their 
corresponding 4-gate exchange sequences to a precision of 10 -4 in the cost function, 
with the corresponding maximum matrix element distance of the order of 1CP 5 . 

An alternative approach to finding gate sequences for local unitaries using the 
standard mapping from SU{2) to 5*0(3) and a quaternion representation of SO(3) 
can be found in Ref. |18) . 

Table |21 shows the gate times for each of the 4 exchange interactions required to 
implement the encoded local unitary operations. Note that the exchange interac- 
tions implementing U\ and U2 can be applied in parallel, as can those for V\ and V2. 
Thus, transforming the 34-gate sequence of Fig. 3 into the exact CNOT gate on 
the computational basis requires 16 additional nearest-neighbor interactions, that 
can be realized as 8 additional parallel gate cycles. 
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Table 2. The exchange gates and corresponding gate times (in 
units of 2H/J) required to transform W into the actual CNOT 
gate. 



The total number of nearest neighbor interactions for the exact CNOT amounts 
to 27 if applied in parallel and 50 if applied serially. 

5. Gate sequences for the three-qubit encoding 

A 19 exchange gate sequence for a gate locally equivalent to CNOT for the three 
qubit encoding has been given in [T2|. The layout in ^B] obeyed certain symmetry 
constraints. We have recalculated this sequence to a higher accuracy with and 
without these symmetry constraints and have also computed the exchange-only 
implementation of the local unitaries Ui,Vi, i = 1,2, needed to transform the 
19-gate sequence to the exact CNOT in the computational basis. The resulting 
sequences for exact CNOT in the three-qubit encoding are given in Appendix [U] 

6. Conclusion 

We have presented an exact construction of a discrete universal logical gate-set 
using only the exchange operation with a four-qubit encoding. These results are 
readily applicable to physical implementation schemes in which exchange interac- 
tions are favored. These include the classical solid-state nuclear-spin qubit model 
proposed by Kane O EH] and the electron-spin qubit proposal of Loss and Di- 
Vincenzo |3 |H]- For a four- fold increase in the number of system qubits and a 
twenty-nine-fold increase in the number of computational cycles for the two-qubit 
operation, we are able to simplify the implementation of spin coupled solid state 
systems by entirely eliminating the need for single-spin ^4-gates. In contrast to the 
rapid and relatively easily-tunable two-spin J-gates, the ^4-gates demand consider- 
ably greater device complexity and (/-factor engineering on solid-state heterostruc- 
tures OEj. 

Thus far, explicit constructions of universal logical gate-sets for exchange-only 
quantum circuits have been given on three-qubit encodings |16| and for the four- 
qubit scheme presented here. It should be noted that in principle the overhead in 
spatial resources can be made arbitrarily small: asymptotically the rate of encoding 
into subsystems converges to unity However we have to carefully evaluate the 
trade-offs in space and time for each encoding. So far no constructive analytical 
methods to lower bound the number of nearest neighbor interactions for encoded 
gates exist. Using numerical methods yields an increase from 19 to 34 gates for a 
gate equivalent to the encoded CNOT going from a three-qubit ^Hj to the present 
four-qubit encoding. This seems to indicate that the rate of growth of the number 
of nearest neighbor gates needed is rather large and that it is probably wise to stick 
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to small encodings if the error correcting properties arc not also to be incorporated. 
However, we note that these are all numerical solutions and are not guaranteed to 
be optimal. It would therefore be useful to obtain analytic bounds on the minimum 
number of exchange gates required for encoded operations. 

Another open problem is the application of encoded universality to other inter- 
actions encountered in nature and in the laboratory, to facilitate the search towards 
optimal physical schemes for implementation of universal quantum computation. 
We believe that the scheme presented here provides a step in this direction and al- 
leviates the task of the quantum engineer working towards spin-coupled solid state 
quantum computation. 
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Appendix A. The Makhlin Invariants 

We give a brief description of how to calculate the Makhlin invariants |20| for 
an encoded two-qubit operation W. These invariants characterize a two-qubit op- 
eration up to equivalence by local unitaries (see Eq. |JBJ). 

In a first step project the physical operator W onto the logical subspace: 

(8) M = P^U e c Zt an9e P; 

P is a 256-by-4 matrix whose column vectors are the basis states {|0l0l), \0l^-l), 
|1l0_l), and M is a matrix in SU(4). We next transform M into the 

"Bell-basis" as M B = Q^MQ, where 
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Finally, we define m = M^Mb, to obtain the invariants Mi = tr 2 (m)/16detM 
and Mi = (tr 2 (m) — tr(m 2 )) / AdetM . For gates that are locally equivalent to the 
CNOT, Mi = and M 2 = 1. 

Appendix B. Numerical search for a gate locally equivalent to 

CNOT 

To obtain a gate U^^ an9e which is locally equivalent to the encoded CNOT, 
we applied a combination of genetic algorithms and Nelder-Mead simplex searches. 
At the beginning of every search, we fixed a sequence of qubit pairs to be coupled 
with an exchange interaction, and optimized the fitness function T with respect 
to time parameters only. No restrictions on symmetry were imposed, unlike 
(see Appendix IO . We started with a small number of couplings and incremented 
the number of exchange interactions after each unsuccesful attempt to find a gate 
equivalent to the CNOT . The final layout of the exchanges is indicated in Figure 3. 
We found that space generated by T was sufficiently complex such that allowing the 
sequence of qubit-pairs to vary during the optimization only introduced unnecessary 
complications into the search. 

Even with the incorporation of the leakage parameter A into fitness function 
T = f + A, the large space of parameters is still marked with many local minima. 
Therefore, the first stage of our search was a genetic algorithm, whose heuristic is 
well-equipped to score large spaces aggressively in order to identify basins in which a 
global minimum may occur. Whereas algorithms based on the hill-descent heuristic 
often trap themselves into basins of local minima, genetic algorithms are able to 
traverse rapidly through regions of the space between the basins, enabling them to 
descend from one basin to another. The pseudocode for the genetic algorithm is as 
follows: 

Step 1: Initialization. Let the initial population consisting of 60 candidates be 
defined as the set Pt. Each member of Pi is a 34-dimensional real- valued vector 
whose elements lie in the interval [0, 2ii\. Each vector represents the genome of a 
candidate, and the j th element in each vector ( a gene) is the time parameter for 
the Hamiltonian in the j th exchange in a fixed sequence of qubit-pair exchanges. 
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Step 2: Fitness Evaluation and Selection. Generate the Makhlin invariants 
Mi , M2 and leakage parameter A for each candidate and rank the candidates ac- 
cording to their fitness scores T . Sort the top 20 performing candidates into the 
parental pool. 

Step 3: Crossover. Randomly pair the 20 members of the parental pool. Each 
parental pair generates two offspring. The first offspring is a random, pairwise 
convex combination of the genomes of the parental pair. Let a and (3 be random 
real variables in the interval [0, 1]. For parents u and v, we have: 

PARENT, = (7«,i,---,7«,3 4 ) 
PARENT, = (7„,i.---»7t>,34) 

(10) OFFSPRINGS = (a 7u ,i + (1 - a) 7 „,i , . . . , a 7u , 34 + (1 - a) lvM ) 

The second offspring is a random geometric average of the genomes of the parental 
pair. 

PARENT, = (7«,i,---,7«,34) 
PARENT, = (7„,i.---.7i;,34) 

(11) OFFSPRING2,,, = { lu ,if {^sf'^ , ■ ■ ■ Alu M f {lv M )^ p 

Intuitively, each candidate in the population represents a point on a simplex within 
the search space. By taking convex combinations and geometric averages between 
the points, we search the face planes of the simplex. 

Step 4: Insertion. We now construct the population of the next generation 
Pt+i- The new population consists of: 

(1) The top (20 + M) candidates from P t , where M is a randomly generated 
integer between and 20 

(2) The 20 offspring generated from the crossover step 

(3) (20 - M) new, randomly-generated candidates 

The purpose in inserting new candidates during each generation is to enable the 
search to extract itself from local minima. If the search simplex has converged to 
a local minimum, the new candidates will serve as vertex points that can pull the 
search into more promising regions within the space. 

Step 5: Mutation. We now subject the population to a random mutation 
process, where each gene (component of Pt) in each genome is perturbed to a 
new value within [0, 27r] with probability .03. It is necessary to introduce these 
mutations, corresponding to small steps in the search simplex, because the cross- 
over operations tend to pass over global minima too rapidly. However, even small 
perturbations in the genome cause increasingly violent movements in T as global 
minima are approached. So to balance these considerations, the top ten performers 
in each generation are exempted from mutations to stabilize the performance of the 
algorithm. 

Step 5': Exit Condition Check. Check if the top-ranked candidate satisfies the 
condition T < e for a sufficiently small e. If not, return to Step 2. 

We ran four simultaneous genetic algorithms with four distinct, randomly-generated 
populations of size 60, and coordinated the search by inserting a clone of the top 
candidate from the population with the best top-performer into the other three 
populations. After 2394 generations, we obtained a candidate with error magni- 
tudes of O(10~ 2 ) with respect to the Makhlin invariants and O(10 _1 ) with respect 
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to the leakage parameter. At this point, the pace of progress in the genetic algo- 
rithm slowed down dramatically, so we used the top performer as the starting point 
for the second stage of our algorithm, a Nelder-Mead simplex direct search. At 
this point, the simplex search was considerably more robust, because the simplex 
heuristic enables the simplex to flex and squeeze itself through narrow valleys of the 
space more sensitively. After 5296 iterations of the simplex search, we obtained a 
candidate with error magnitudes of O(10~ 6 ) with respect to the Makhlin invariants 
and O(10 -2 ) with respect to the leakage parameter. Since this was the first time we 
had advanced to such a low point in the space, we ran the genetic algorithm again 
to see if any further improvement could be obtained in this manner, and to acquire 
some further intuition about the structure of the space. After 471 generations, only 
a trivial improvement was obtained, so we returned to the simplex method once 
more. After 22081 iterations of the simplex search, we obtained error magnitudes 
of O(10 -10 ) with respect to the Makhlin invariants and O(10~ 8 ) with respect to 
the leakage parameter. 

Appendix C. Gate sequences for CNOT with the three-qubit 

ENCODING 

In Ref. [T2|, a numerical search was utilized to generate a CNOT gate from a se- 
quence of nearest-neighbor exchange interactions on a system of two computational 
qubits, each encoded by three physical spin-1/2 qubits. The three-qubit encoding 
is 

\Vl) = \\S)®\1) 

(12) |i i ) = y||r + )®|o)-i|T )®|i), 

where \S) = -j= (jlO) - |01», \T+) = 1 11) and \T Q ) = (|10) + |01)). The numeri- 
cal search was made by minimization of a similar fitness function to that employed 
here, i.e., including both matrix distance from CNOT and leakage penalty func- 
tions. A candidate sequence of 19 exchanges equivalent to CNOT up to local 
transformations on the encoded basis (U^^ an9e ) was obtained for a layout con- 
taining two simplifying symmetries in the exchange gate times. These symmetries 
are illustrated in Figure 5, with the gate switching times given in Table 3. Here 
we give also the exchange-only implementation of the local gates Ui,Vi, i = 1,2 
(cf. Eq. ©), needed to convert the 19-gate sequence into the exact CNOT in the 
computational basis. These were obtained using the same procedures as described 
in Section Hand Ref. [T5| . 

The first symmetry is that certain gate times are repeated in a spatially symmet- 
ric configuration, e.g., t± occurs at the beginning and end of the sequence between 
physical qubits 3 and 4 in each case. We denote this the repetition symmetry. A 
second symmetry apparent in Figure 5 is that certain sequential pairs of interac- 
tion times are related by analytic functions. In particular, for k — 5,6,8,10, the 
functions 

(13) c fc = tan(tfc)tan(£fc) + 2 

are exactly equal to zero. We denote this symmetry a correlation symmetry. With 
these symmetry restrictions the optimal solution for a 19 exchange gate sequence 
obtained in Ref ^jl yields a Makhlin invariant fitness value of / on the order of 
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qubit 2 



qubit 1 



Figure 5. Layout for the exact CNOT. The 19-gate sequence 
for a gate U^^ an9e locally equivalent to CNOT (£4 — tw, £5 — i\a) 
is taken from Ref. \n$\ (time indices are shifted by three). Gate 
switching times are given in Table 3. 
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Table 3. Gate switching times for the 26 gate sequence of Figure 
5, given in units of 2H/J. This corresponds to it times the time 
units employed in Ref. |16) . 



10~ 10 and a leakage parameter A on the order of 10~ 8 . The overall precision of 
the sequence, given by the maximum matrix element distance to the CNOT in the 
computational basis is of order 10~ 6 . 
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We have performed a new set of numerical searches for this same 19-exchange 
gate layout in the three-qubit encoding, using the techniques described in this paper 
and without imposing any symmetry constraints on the exchange times. We find 
that not only can higher quality numerical solutions be obtained, but also that 
these are very significantly improved when the simplifying symmetry constraints 
are lifted. This results in a larger set of independent variables (19 instead of the 
7 given in Ref. ^j) but with the advantage of a considerably smaller value of the 
cost function. 

We performed a search with a Nelder-Mead simplex algorithm whose optimiza- 
tion criterion was the minimization of the Makhlin fitness function / and the leakage 
parameter A, starting with a random initial set of exchange gate times. Figure 6 and 
Table 4 shows the corresponding exchange sequence, together with the exchange- 
only implementation of the local gates Ui,Vi, i = 1, 2 needed to convert the 19-gate 
sequence into the exact CNOT in the computational basis. This search yielded zero 
values of Makhlin fitness function / and leakage parameter A to within machine 
precision (10 -16 ), and a maximum matrix element distance from the exact CNOT 
of 1CP 9 . This provides a significant improvement over the corresponding overall 
precision of 10~ 6 for the original 19-gate sequence of Ref. 




Figure 6. Layout for the exact CNOT obtained by unrestricted 
optimization of times ti — £25 for U^ l ^ an9e and of t\ — t§ and 
^26 — £31 for the local unitaries Ui 7 Vi, i — 1,2. Gate switching 
times are given in Table 4. 

In the solution described in |16| . the correlation symmetries were satisfied to 
machine precision, while the repetition symmetries satisfied exactly. We note that 
these correlation and repetition symmetries are not essential to the task of imple- 
menting the encoded CNOT operation. From an optimization perspective, they 
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Table 4. Gate switching times for the 31 gate sequence of Figure 
6, given in units of 2h/ J. 



might even be interpreted as a hindrance that constrains the trajectory of numerical 
searches to lie in sub-optimal subspaces of the control parameter space. It is not 
clear whether the symmetries in the solution obtained in Ref. JS| suggest the exis- 
tence of analytical solutions to the optimization problem of the cost function, and 
if so whether these correspond to local or global minima. The above example shows 
that, without any symmetry restrictions and allowing the number of independent 
time parameters to increase, improvement to optimization of the cost function to 
within machine prevision can be obtained. 
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